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Abstract: New direct acting antivirals (DAAs) suchi as daclatasvir (DCV; BI\/IS-790052), whicti target 
NS5A function witli picomolar potency, are showing promise in clinical trials. The exact nature of 
how these compounds have an inhibitory effect on HCV is unknown; however, major resistance 
mutations appear in the N-terminal region of NS5A that include the amphipathic helix and domain 
1. The dimeric symmetry of these compounds suggests that they act on a dimer of NS5A, which is 
also consistent with the presence of dimers in crystals of NS5A domain 1 from genotype 1b. Geno- 
type 1a HCV is less potently affected by these compounds and resistance mutations have a greater 
effect than in the lb genotypes. We have obtained crystals of domain 1 of the important la NS5A 
homologue and intriguingly, our X-ray crystal structure reveals two new dimeric forms of this 
domain. Furthermore, the high solvent content (75%) makes it ideal for ligand-soaking. Daclatasvir 
(DCV) shows twofold symmetry suggesting NS5A dimers may be of physiological importance and 
serve as potential binding sites for DCV. These dimers also allow for new conformations of a NS5A 
expansive network which could explain its operation on the membranous web. Additionally, sul- 
fates bound in the crystal structure may provide evidence for the previously proposed RNA binding 
groove, or explain regulation of NS5A domain 2 and 3 function and phosphorylation, by domain 1. 
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Introduction 

Hepatitis C Virus (HCV) is expected to be an 
increasing global healthcare concern.^ HCV is a 
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major cause of chronic hepatitis, liver cirrhosis and 
hepatocellular carcinoma (HCC) and is estimated to 
infect up to 3% of people worldwide.^ No protective 
vaccine currently exists against HCV and the most 
common combination therapy based on pegylated- 
interferon ol2 and ribavirin are effective in only a 
fraction of patients and these are plagued by 
adverse effects,^ even in combination with new 
drugs such as telaprevir (Vertex Pharmaceuticals) 
and boceprevir (Merck & Co.).*''^ The HCV species IS 
highly genetically diverse and classified into six 
genotypes (1-6) with more than 90 different sub- 
types. Genotypes la and lb are the most common, 
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accounting for about 60% of global infections.® The 
distribution of HCV genotypes varies globally and 
this heterogeneity affects virus pathogenicity and 
their responsiveness to therapies (WHO, 2003). 
Hence, current efforts being pursued in the treat- 
ment of HCV are focused on the development of 
direct acting antivirals (DAAs).' The HCV genome 
encodes 10 proteins-consisting of the structural pro- 
teins; core, El glycoprotein and E2; and the non- 
structural viral proteins p7, NS2, NS3, NS4A, 
NS4B, NS5A, and NS5B.^ All of the HCV proteins 
are essential for efficient HCV propagation and are 
therefore all potential targets for antiviral com- 
pounds. The recently approved NS3-4A protease 
inhibitors, telaprevir and boceprevir, used in combi- 
nation with the established interferon and ribavirin 
therapy have created a new standard of care for 
patients with genotype 1 infections.^ In addition, 
DAAs showing high potency against the functionally 
well-characterized NS5B polymerase are already 
showing promise in phase III clinical trials,^" while 
HCV proteins p7, NS4B, and NS5A are currently 
under investigation as therapeutic targets. The 
continued development of combination therapies 
with more effective and tolerable DAAs that also 
allow the elimination of interferon-based regimes 
will be fundamental in the future successful treat- 
ment of HCV infection. 

NS5A is a 447 amino acid phosphoprotein of 56/ 
58 kDa implicated in a variety of roles from the rep- 
lication of HCV RNA, the modulation of the host cell 
responses and the assembly of viral particles. 
Despite this critical and expansive role in the virus 
life cycle, no specific mechanism is yet ascribed to 
NS5A. Domain 1 of NS5A consists of a zinc-binding 
motif^® and an N-terminal amphipathic helix which 
is necessary and sufficient for association of NS5A 
with the ER membrane^'^ and the structure is essen- 
tial for HCV genome replication.^® Domains 2 and 3 
appear to form an intrinsically unstructured region 
of the protein with roles in RNA replication and the 
assembly of hepatitis C virus particles. 

The pivotal role of NS5A in virus replication 
makes it an ideal target for inhibition. Recent high- 
throughput screens (HTS) for inhibitors of HCV rep- 
licon propagation yielded leads that were optimized 
into compounds with picomolar inhibitory potency 
Fig. 1. To date, no exact binding data have been vali- 
dated, although resistant mutant analysis has 
revealed several mutations within domain 1 of the 
NS5A gene that confer sensitivity to inhibitors'^ and 
mono- and bis-crosslinking of a photoaffinity probe 
with the N-terminal region of NS5A'' has been dem- 
onstrated. An example of such a compound is dacla- 
tasvir (DCV, Bristol-Myers Squibb, Wallingford, CT), 
which is the representative of a mechanistic class 
that exhibits picomolar half maximal effective con- 
centrations (EC50) towards HCV replicon cell cul- 
tures'^ with the twofold symmetry plajdng an 
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Figure 1. Chemical structure of the family of NS5A-targetlng 
molecules. 

important role in its effectiveness, particularly with 
respect to pan-genotype inhibition. More specifically, 
la genotypes show lower sensitivity (approximately 
5- to 10-fold lower) than the lb genotypes with sym- 
metrical or pseudo-symmetrical compounds and 
resistance mutations generated in genotype la repli- 
cons result in a greater increase in resistance (shifts 
in EC50 values approximately 2 magnitudes greater) 
than those conferred from similar mutations in 
genotype lb.'* However, smaller, monomeric com- 
pounds can exhibit potent activity towards genotype 
lb replicons but a much reduced potency towards 
genotype la, initially viewed as a hallmark of the 
NS5A inhibitor class."^ 

Crystal structures of genotype lb NS5A domain 
1 (NS5A-D1) have revealed the formation of dimers 
during crystallization. Although the formation of 
multimers or protein-protein contacts are inevitable 
consequences of protein crystallization,'® there is 
some in vitro evidence of dimerization'^ and the 
phenomenal potency of this class of dimeric com- 
pounds has led to speculation that they act on a 
physiologically-relevant dimer present in NS5A-D1 
crystal structures. While unproductive binding can 
occur," productive binding, hypothetically, over- 
stabilizes NS5A dimerization'^ potentially in the 
form of an immature HCV polyprotein dimer, or 
higher order multimer, to slow the cleavage of the 
NS4B-NS5A precursor.'® This hypothesis is sup- 
ported by the bis-crosslinking of NS5A by BMS-351, 

Crystal structure of HCV genotype la NS5A domain 1 



an inhibitor of HCV replication that incorporates 
photo-activated functionahty^^ and the dramatic 
improvement in potency obtained with symmetrical 
and pseudo-symmetrical inhibitors relative to half 
molecules, particularly with genotype la.^®'^° The 
first crystal structure of a NS5A-D1 construct con- 
sisting of residues 25 to 215 from genotype lb sub- 
type Conl, revealed it to consist of two subdomains 
and to contain a novel fold with a stabilising disul- 
phide bond. Additionally, the asymmetric unit com- 
prised a dimer with an intermolecular interface of 
approximately 830 A^, which forms a large groove 
that could accommodate viral RNA;^^ this dimer has 
served as the basis for the working model of the 
NS5A-D1. Crystallization of a NS5A-D1 construct 
from the same genotype and subtype containing resi- 
dues 33 to 202 resulted in higher resolution struc- 
tures and showed that the NS5A-D1 formed an 
alternative dimer with a 910 A^ dimeric interface. 
Through combining the two dimers, it was hypothe- 
sized how a NS5A superhelical array could form.^^ 
The first crystal form does not seem to be readily 
reproducible and soaks of the second crystal form 
with compounds has yet to yield structures of 
ligand-bound complexes (unpublished data). 

Neither crystallographic nor spectroscopic evi- 
dence for compounds such as DCV binding to NS5A- 
Dl is currently available although binding of a 
structurally related affinity probe to genotype lb 
NS5A has been demonstrated.^^ To address this and 
investigate the differences between the compound's 
effect on genotypes lb and la, we decided to deter- 
mine the structure of NS5A-D1 from genotype la. 
Our crystals of domain 1 from genotype la NS5A 
show a new crystal form, and allowed structural 
determination via X-ray crystallography to a resolu- 
tion of 3.5 A. Intriguingly, our structure shows two 
new dimeric forms of this domain. NS5A-D1 dimers 
allow the formation of new hypothetical multimer 
assemblies, which form an expansive network that 
could operate in the membranous web. Experimental 
observation of RNA binding^^'^^'^^ support a hypoth- 
esis that sulfates from the crystallization buffer 
bound in the crystal structure may mimic a RNA 
backbone and highlight the location of a RNA bind- 
ing groove. Alternatively, this region of NS5A may 
bind phosphorylated sites of NS5A domain 2 and 3 
to alter its overall conformation and regulate pro- 
gression from RNA synthesis to viral assembly: this 
may explain how DCV expresses two MOAs,*^ affect- 
ing both viral replication and assembly, and blocking 
hyperphosphorylation of NS5A. 

Results 

Crystal structure of genotype la NS5A-D1 

Our crystals of genotype la NS5A-D1 represent a 
new crystal form. The crystal structure consists of 



four monomers of the genotype la NS5A-D1 
arranged into T-shaped asymmetric units (Fig. 2 and 
Supporting Information Fig. SI), which are juxta- 
posed to form large solvent channels; the crystals 
have a Matthews coefficient of 4.73 A^/Dalton and a 
solvent content of 73.97%.^'' Similar surfaces of the 
molecule are exposed to the solvent as in the most 
recent lb NS5A Domain 1 dimer crystal structure. 

The overall fold of each monomer in our model 
superposes well with the NS5A domain structures 
from genotype Ib^^'^^ with an average Ca RMSD of 
0.57 A between la and lb monomers. The zinc- 
binding motif has the same coordination as the 
genotype lb NS5A Domain 1 with an average dis- 
tance of 2.24 A between coordinating sulphurs and 
the zinc atom. These bond lengths are slightly less 
than the ideal distance (2.35 ± 0.09 A) for structural 
metal zinc-coordination sites and slightly more than 
the ideal distance (2.08 ± 0.13 A) for catalytic metal 
zinc-coordination sites. The arrangement and prox- 
imity of residues Cys 142 and Cys 190 in our model 
and the presence of electron density in between the 
residues suggests the formation of a disulphide 
bridge, despite the presence of DTT in the crystalli- 
zation buffer, with a distance between the sulfur 
atoms of 2.0 A, 2.0 A, 2.0 A, and 2.5 A in monomers 
A, B, C, and D, respectively. It is not known whether 
this disulphide bond is physiologically important or 
essential for HCV replication.^® 

Amino acids present in the construct that could 
not be fitted to any electron density include the dis- 
ordered N- and C-termini of three to four amino 
acids and 13 to 16 amino acids, respectively, depend- 
ing on the monomer. Loops between fj-strands B7 
and B8, and p-strands B8 and B9 could not be fully 
modeled in monomer A, in which modeling of resi- 
due Glu 172, and residues Gly 178 and Leu 179, 
respectively, had to be omitted. 

Situated in between monomers A, C and D, 
there was a large unmodeled region in the electron 
density difference map appearing to be coordinated 
by nitrogens e from residues Arg41 and Arg81 from 
monomer A and by a nitrogen t| from Arg78 from 
monomer B. Owing to the presence of these 
positively-charged side-chains and the high concen- 
tration of sulfate ions in the crystallization solution, 
it was modeled as S04^~ (Supporting Information 
Fig. S2A). Arg41 and Arg 78 are strictly conserved 
as positive amino acids (>90% Arg or Lys at position 
41 and >95% Arg or Lys at position 78), whereas 
Arg81 shows no conservation (Arg, Ser, Ala, Met, or 
Leu substitutions at position 81). Although the elec- 
tron density matched that expected for a sulfate ion, 
we cannot rule out that this site is occupied by 
another anion. Another region of electron density 
was identified adjacent to monomer A but was not 
modeled as it was less well defined. Nevertheless, it 
shows potential coordination by Arg 44 from chain C 
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This figure also includes an IMolecules 3D interactive version that can be accessed via the link at the bottom of this figure's caption. 
A C 






Figure 2. Genotype 1b and la NS5A domain 1 crystal structure dimers in ribbon representation. The side chain of tyrosine 93, 
a major resistant mutation, is displayed as spheres. Monomers are shown in orange and green, and coloured in pastel for geno- 
type lb dimers. (A) The first crystal structure dimer^^ forms a potential RNA-binding pocket. (B) The second genotype lb 
dimer^^ are juxtaposed in parallel to form an extensive interface. (C) Monomers A and B from the genotype 1a NS5A domain 1 
crystal structure share the same interface as the Love ef a/, dimer shown in B albeit with the monomers arranged in an antipar- 
allel fashion. Use this link to access the interactive version of this figure. (D) Monomers C and D form a N-terminal, head to 
head dimer. Use this link to access the interactive version of this figure. 



and the backbones of Gly 45 and Met 72 from chain 
A and we tentatively propose that this region 
includes an additional sulfate ion (Supporting Infor- 
mation Fig. S2B). 

Dimer interfaces 

Two new dimeric forms of NS5A-D1 are observed 
from the crystal packing shown in the asymmetric 
unit. Monomers A and B of our structure are related 
by a twofold axis (179.9°) perpendicular to their 



length. Dimer AB displays a similar interface to 
that previously reported,^^ but one of the monomer's 
orientation is reversed creating a head-to-tail confor- 
mation [Fig. 3(A)], rather than head-to-head. Dimer 
AB has a smaller buried solvent-accessible surface 
area of 435.1 A^ than that calculated for the Love 
et al^^ dimer (957.1 A^), due to the more detached 
packing of the two molecules — the distance between 
the centers of mass of the monomers in dimer AB is 
23.1 A compared to 21.4 A in the Love et al. dimer 




Figure 3. (A) Monomers A (orange) and B (green) make up dimer AB. (B) A secondary structure alignment of monomer A from 
our genotype la crystal structure with monomer B from the Love ef al. dimer (blue). Dimer AB shares a similar dimer contact 
face to the Love et al. dimer except it is less closely packed and in a head to tail conformation. 
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Figure 4. (A) Monomers C (orange) and D (green) make up dimer CD. (B) A secondary structure alignment of monomer A from 
tlie Love ef a/, dimer (blue) with eacli monomer C and D shows an extension of p-strand B3 from Val 75 to Arg 78. (C) Hydro- 
gen bonds are formed between residues lie 74 O and Arg 78 NH, Gly 76 NH and Gly 76 O, forming a total of four hydrogen 
bonds in the CD dimer. 



[Fig. 3(B)]. The contact surfaces are formed by resi- 
dues 97-99, 112-115, 149, 160-161. Amino acid resi- 
dues Pro 97, Cys 98, and Thr 99 show low 
conservation and are in contact with the conserved 
residue Phe 120 of the other dimer. Conserved 
amino acid residues Arg 122, Val 123, Ala 125, and 
the non-conserved Ser 124 form contacts with con- 
served residues Arg 160 and Phe 161 (which is Tyr 
or His in other genotypes). All of these 10 residues 
are also included in the contact area of the Love 
et al. dimer, whose contact area also includes an 
additional 14 residues. Pairs of residues forming 
notable interactions reported in the Love et al. 
dimer, such as the salt bridge formed by conserved 
residues Glu 148-Arg 112, the backbone hydrogen- 
bond between conserved residues Ala 92 and Gly 96, 
and an electrostatic interaction between residues 
Arg 48 and Glu 116, are not close enough to form in 
dimer AB. 

Monomers C and D form a second dimer CD 
that shows an approximately twofold (173.6°) sym- 
metry axis, composed of the monomers aligned in a 
head to head conformation [Fig. 4(A)]. The mono- 
mers form a buried solvent-accessible surface area of 
315. OA^. The contact surfaces are formed by residues 
74 to 78 and 83 to 84. Residues 74 to 78 from each 



molecule appear to form an intermolecular anti- 
parallel p-sheet through an extension of p-strand B3 
from Val 75 to Arg 78 [Fig. 4(B)]. The formation of 
this intermolecular p-sheet does not disrupt the p- 
sheets within each molecule. All the residues 
involved in the intermolecular p-sheet lie 74 to Arg 
78 are conserved. Residues 83 to 84 from each mole- 
cule may stabilize the dimer through additional van 
der Waals interactions. Met 83 shows conservation 
as a hydrophobic residue (>99% residues are Met, 
Thr, Val, or He) while Trp 84 is highly conserved 
(>95% Trp). In forming the intermolecular p-sheet, 
the refined structure shows intermolecular backbone 
hydrogen bonds between pairs of residues He 74 O 
with Arg 78 NH, and Gly 76 NH with Gly 76 0. 
Copies of dimer CD are packed together in the crys- 
tal with the same interface observed in dimer AB. 

Simulation of the NS5a CD-dimer/DCV complex 

The physical chemical properties and dimensions of 
the cleft between the head-to-head CD-dimer match 
well with that of the DCV molecule. Although crys- 
tal soaking experiments with DCV failed, this is not 
surprising as the amphipathic helix and its linker is 
absent in our structure. The high frequency of 
escape mutants occurring in this region suggests an 
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Figure 5. DCV docked into the cleft between the CD dimer. The surface of the dimer is coloured by its electrostatic potential 
(red is negative, gray is neutral, and blue is positive). The N-terminal residues are shown in ribbon (C-domain in green, D- 
domain is orange). Key resistant positions are shown with van der Waals surface (carbon: gray, nitrogen: blue, oxygen: red, sul- 
fur: yellow, polar hydrogen: white). DCV is shown in a ball and stick depiction (carbon: aquamarine). A) Top view of DCV 
docked symmetrically through the cleft. B) Side view focuses on the cap of DCV docked into the cleft. Yellow dotted lines indi- 
cate the hydrogen bond between the DCV valine carbonyl and the NS5A Glu62 backbone amide NH. Note: L31 packs against 
the cap valine moiety of DCV. C) Communication network between DCV and Tyr93. The ribbon depiction of the CD monomer is 
shown in green and orange, respectively. The van der Waals surface of the DCV (carbon: aquamarine) and the Glu62, Arg56, 
and Tyr93 (carbon: gray) communication network. Use this link to access the interactive version of this figure. 



active role drug binding. We therefore performed 
molecular dynamic simulations to add the experi- 
mental NMR structure of the N-terminal helix to 
our structure and model DCV into the interdimer 
cleft. In the final modeled structure the central floor 
region of the cleft is composed of hydrophobic resi- 
dues Val75, Gly76, and Pro77 which lay under the 
biphenyl core and between the caps of DCV while 
the periphery of the putative binding site contains 
more polar residues including His54, Arg56, Glu62, 
His66, Arg73, and Thr79 which are proximal to the 
cap regions of DCV (Fig. 5). The only hydrogen- 
bonds between DCV and the NS5A CD-dimer 
observed in the docked model are between the back- 
bone NH's of Glu62 and the two DCV valine carbon- 



yls (Fig. 5). To further evaluate the putative binding 
model, a 40 ns MD simulation was conducted. The 
model drifted slightly from the starting structure 
over the heating and equilibration phase and then 
stabilized (Fig. 5). Each monomer and DCV 
remained stable over the production phase; however, 
the RMSD for the CD-dimer shows a slow oscillation 
as the structure fluctuates about the long symmetry 
axis (see movie, supplemental material). The mean 
distance between DCV and the amino acids that line 
the putative binding cleft within the CD-dimer are 
shown in Supporting Information Table I. The two 
hydrogen-bonds observed between DCV and the 
dimer did not remain constant but periodically 
formed and broke throughout the simulation. 
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Table I. Crystal Data Collection and Refinement 
Statistics 



Crystal parameters 




Space group 


P2i2i2i 


Cell dimensions 


a = 100.25 A; 




b = 101.53 A; 




c = 148.70 A 


Molecules per asymmetric unit 


4 


Data collection 




Beamline 


DLS 124 


Wavelength (A) 


0.97780 


Resolution range (A) 


51.475-3.5 (3.67-3.5) 


Unique observations" 


17168 (1226) 


I^merge 


0.135 (0.388) 


<I>/aP 


3.5 (1.9) 


Completeness (%)" 


86.0 (87.5) 


Multiplicity" 


3.4 (3.3) 


Refinement 




■Rwork/^free(%) 


22.49/26.64 


No. protein residues in monomer 


159 


RMSD stereochemistry 




Bond lengths (A) 


0.0147 (monomer A) 


Bond angles (°) 


2.453 (monomer A) 


RMSD of monomers 




B to A (A) 


0.412 


C to A (A) 


0.454 


D to A (A) 


0.622 


PDB accession code 


4CL1 



" PDB, Protein Data Bank. Values reported in the format: 
overall data (last resolution shell). RMSD (root-mean- 
square deviation) stereochemistry is the deviation from 
ideal values. 



The most frequently observed, clinically- 
relevant NS5A genotype la (GT-la) resistance muta- 
tions have been identified at amino acids 28, 30, 31, 
and 93 and the linked positions Q30R-E62D, 
whereas for GT-lb, the major resistant changes 
occurred at residues 31 and 93.^^ The key resistance 
mutation that is present in our structure is Tyr93. 
Although this residue does not reside directly within 
the putative binding site, it is connected to DCV via 
a communication network in the MD simulation. 
During the course of the simulation, DCV intermit- 
tently hydrogen-bonds with Glu62 via the backbone 
amide NH to the DCV valine carbonyl or the Glu62 
side-chain and the DCV imidazole NH; Glu62, in 
turn, periodically forms a salt bridge with Arg56 
which packs against Tyr93 to complete the network. 
This allosteric mechanism of resistance is supported 
by the recent demonstration that resistant NS5A 
does not necessarily eject an inhibitor and that bind- 
ing of an inhibitor to NS5A alone is not sufficient for 
inhibition. It is worth noting that mutations are 
often to smaller or more flexible residues, which 
may allow NS5A to recover its ability to articulate 
in the presence of an inhibitor leading to a more 
flexible dimer interface or allowing NS5A to popu- 
late other dimer structures that are required for 
function. Interestingly, the communication network 
between Tyr93 and DCV also contains Glu62, a posi- 
tion that has been associated with resistance. How- 



ever, Glu62 resistance normally arises when linked 
with the primary Q30R mutation.^® 

Discussion 

It has been established that dimeric inhibitors, 
either symmetrical or pseudo-symmetrical, are a key 
factor in the antiviral activity in their compounds 
such as DCV, which shows 50 and 9 pM EC50 values 
against la and lb HCV replicons, respectively. Key 
resistance mutations are generated at Leu 31 and 
Tyr93 in both genotypes and additionally at Met28 
and Gln30 in genotype la.^^ Similar resistance pro- 
files are generated in response to treatment with 
structurally related compounds including BMS-766 
(Bristol-Myers Squibb), Ledipasvir/GS-5885 (Gilead 
Sciences), GSK2336805 (GlaxoSmithKline), PPI-461 
(Presidio, structure not disclosed) as well as 
AZD7295 (AstraZeneca/Arrow Therapeutics) which 
represents a distinct chemotype that only weakly 
inhibits genotype la HCV (Fig. 1). The crystal struc- 
ture the genotype la NS5A reveals that these crys- 
tals have very high solvent content. Our new crystal 
form is therefore likely to be amenable to soaks with 
large compounds and would serve as an additional 
tool in the development of novel NS5A-targeted 
therapeutics. 

The current working hypothesis is that such 
compounds act on an NS5A dimer, based on the 
dimeric nature of DCV analogs, the location of NS5A 
resistance mutations, and mono- and bis- 
crosslinking data, albeit with a lack of crystallo- 
graphic or spectroscopic evidence. Publication of the 
crystal structure for two genotype lb dimers^^'^^ has 
caused debate as to which dimer is biologically rele- 
vant or whether several dimer forms are of impor- 
tance. Our new results provide further candidates 
for physiologically-relevant NS5A dimers. Due to the 
dimensions of the compounds targeting NS5A, we 
deem that the head to tail conformation of dimer AB 
places the two N-termini too far apart, although this 
does not rule out another physiological role for 
dimer AB. The monomers of dimer CD have adjacent 
N-termini and this dimer could represent a potential 
target of the compound. Although dimer CD has a 
relatively small buried solvent accessible surface 
area of 315.0 A^, the inclusion of the N-terminal 
amphipathic helix in vivo may further stabilize its 
formation. It is conceivable that DCV may disrupt 
the HCV life cycle by stabilizing the formation of a 
weak dimer such as the dimer CD. 

Through the use of molecular docking and 
molecular dynamics we have shown that the binding 
cleft between the head-to-head CD-dimer could 
accommodate DCV in a fashion such that the key 
resistance mutations lie in proximal to the inhibitor. 
This demonstrates the CD-dimer could serve as a 
potential binding site for DCV in addition to the pre- 
vious models which show DCV could interact with 
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Figure 6. Viral RNA approximately modeled into NS5A 
domain 1 dimer. A structural alignment of monomer A and its 
associated sulfates from our genotype 1a crystal structure 
with each monomer in the Tellinghuisen ef a/, dimer places 
the sulfates in the dimer's RNA-binding groove. The sulfates 
may be mimicking an RNA phosphate backbone allowing the 
modeling of viral RNA into the dimer. Use this link to access 
the interactive version of this figure. 



genotype lb dimers. Specifically, the two muta- 
tions observed in our structure (Y93H and E62D) 
are within two amino acid layers of DCV and estab- 
lish a communication network through R56. How- 
ever, based on the relative strength of the Y93H and 
E62D resistance that is observed, we cannot rule out 
that other binding modes are possible as DCV has 
been shown to complement the symmetry of the 
previous NS5A dimers close to primary clinical 
resistance sites. Binding poses of BMS-411with the 
face-to-face NS5A domain I dimer structure from 
Telhnghuisen et al.^^ and GSK2236805 with the 
back-to-back Love et al. structure^^ have been previ- 
ously proposed. As with our dimer model 
reported herein, the amino-terminal amphipathic a- 
helix was modeled using the NMR structure by 
Penin et al.^^ All the models suggest a symmetric 
binding mode along the dimer interface where inhib- 
itors can interact with regions proximal to resistance 
mutations at positions at 28, 30, 31, 54, and 93 and 
genotype la/lb changes at positions 28, 30, 54, 56, 
58, and 62. While the Love et al. dimer provides a 
cleft for binding with good shape complimentarily to 



the NS5A inhibitors, the chemical nature of the cleft 
is reversed from that of the inhibitors. Specifically, 
the hydrophobic biphenyl core of the inhibitor is 
positioned over the two adjacent T95 amino acid res- 
idues at the center of the dimer interface and the 
polar imidazole moieties sit over the hydrophobic 
aromatic ring of the Y93 side chains. A better match 
is seen in the EMS model where the inhibitor core 
lies over the two adjacent F37 residues at the dimer 
interface and the imidazoles are positioned for 
hydrogen bonding with Q62 from each monomer and 
the N-terminus is placed adjacent to the inhibitor. 
However, it should be noted that, unlike our dimer, 
this crystal structure does not present a cleft for 
inhibitor binding in the absence of the modeled in 
N-terminal helix. While each model has its merits, it 
is also conceivable that packing forces contribute to 
the formation of dimeric species during crystalliza- 
tion and these may not be physiologically signifi- 
cant. Our preferred explanation is that NS5A binds 
a variety of cellular targets and these dimers 
explain some of the multiple interactions of NS5A. 
Additional mutational and structural studies are 
needed to elucidate the precise details of the NS5A/ 
inhibitor complexes. 

The discovery of our new NS5A dimers suggests 
that the formation of an extended multimeric net- 
work of NS5A may occur through the utilization of 
various dimer interfaces. The combination of the two 
genotype lb NS5A-D1 dimers allowed the formation 
of a superhelical array that provides a model for the 
oligomerization of NS5A and NS5A-D1 constructs 
observed in vitro. However, the formation of this 
higher order oligomer into a non-planar, superhelical 
array precludes its association with a lipid bilayer.^^ 
By combining the different conformations of the new 
NS5A-D1 dimers present in our crystal structure, 
one can envisage an array of NS5A molecules that 
forms a more expansive network that could interact 
with the membranous web (Supporting Information 
Fig. S3). NMR experiments of NS5A constructs have 
shown the protein can reversibly aggregate in a con- 
centration dependent manner (data not shown), pos- 
sibly into ordered oligomeric states. Such complexes 
could imaginably have roles in the formation of the 
membranous web, the replication complex, and/or 
the assembly complex on lipid droplets. 

As S04^~ and the phosphate from an RNA back- 
bone have similar conformations and charge, it could 
potentially mimic the natural RNA ligand of NS5A. 
Interestingly, while the sulfate bound within our 
crystal structure performs a stabilizing role in crys- 
tal formation, a structural alignment of monomer A, 
which coordinates the S04^~ via residues Arg 41 
and Arg 81, with the Tellinghuisen et al. genotype 
lb structure places the anion in their proposed 
RNA-binding groove^^ (Fig. 6). Alignments with 
other NS5A domain 1 dimers do not place the S04^~ 
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in any recognizable RNA binding groove. Assuming 
these to represent physiological RNA binding sites, 
it is possible to accommodate the approximate posi- 
tion of viral RNA (7rU) in the proposed RNA- 
binding groove of the Tellinghuisen et al. structure^^ 
suggesting this "arched" dimer may be relevant for 
RNA binding by NS5A in the HCV life cycle (Fig. 6). 

Another possible role for this binding site could 
be to sequester the phosphorylated C-terminus of 
NS5A. Phosphorylation of NS5A is thought to regu- 
late its interaction with a variety of host and viral 
binding partners, hence regulating HCV replication 
and assembly. Therefore binding of a phosphorylated 
residue within the C-terminal would alter NS5A 
specificity for target host proteins and progression 
through the viral life cycle. This could explain how 
the targeting of NS5A inhibitors to NS5A-D1 could 
affect phosphorylation in domains 2 and 3, and halt 
the transition from viral RNA replication to virus 
assembly.^^ Recently, the hyperphosphorylation of 
NS5A in domains 2 and 3 has been shown to be 
dependent on the phosphorylation of at least one 
residues in NS5A domain.^" This regulation of NS5A 
domains 2 and 3 by domain 1 could explain findings 
that have determined that DCV expresses dual 
MOAs: interfering with the roles of NS5A in both 
viral RNA synthesis and virion assembly,*^ with the 
latter being an offshoot of NS5A-D1 dimer destabili- 
zation/stabilization. 

Our current understand the mode of action for 
NS5A inhibitors centers on the disruption of the for- 
mation and function of the replication complex.^^ It 
is thought that the compound-interaction site on the 
surface of NS5A is exposed only transiently during 
the HCV life cycle, which explains why interactions 
with NS5A have only been detected by treatment of 
whole replicons with compounds and not in 
vitro.^^'^^ The binding of inhibitors to NS5A perhaps 
during replication complex formation, could result in 
conformational changes to NS5A and the subsequent 
formation of non-functional replication complexes. 
The isolation of a NS5A dimer via chemical cross- 
linking of photoaffinity DCV analogs, provided the 
first direct evidence of compounds interacting with 
the N-terminal region of NS5A^^ and emphasized 
the importance of dimerization of NS5A in the HCV 
life cycle and its inhibition. The structural distortion 
of NS5A by the DCV analogs may result in inhibi- 
tion of the replication complexes through concerted 
communication through NS5A multimers.**^ The 
NS5A domain 1 allows a variety of multimerized 
NS5A complexes to be built into an NS5A network. 
We suggest that all of the observed NS5A dimers 
could be physiologically relevant and involved in the 
formation of this network. Within this proposed net- 
work, at least one dimer mode would be the target 
of these dimeric DCV-like inhibitors leading to net- 
work disruption and malfunction of the replication 



complexes. As NS5A has multiple roles and NS5A 
inhibitors affect viral RNA synthesis and virion 
assembly,*^ we suggest this proposed NS5A network 
also serves a purpose in viral assembly. The new 
dimer modes observed in the genotype la NS5A 
structure not only extends our knowledge of interac- 
tional surfaces that could play a role in the action of 
symmetrical NS5A inhibitors, but supports the 
notion that NS5A dimers play a critical physiological 
role in the viral life cycle. 

Materials and Methods 

Cloning and purification of 1a NS5A D1 

Recombinant la (isolate H77) NS5A (residues 33- 
202) with a N-terminal TEV protease cleavage site 
(ENLYFQGSM) was expressed using the pET32 
plasmid in BL21(DE3) Escherichia coli (Novagen) at 
15°C for 16 h. Cells were harvested and resuspended 
in a wash buffer consisting of 100 mM Tris-HCl (pH 
8.0), 300 mM NaCl, 10 mM imidazole, 10% glycerol, 
in addition with the protease inhibitors aprotinin at 
2 |.ig/mL, leupeptin at 10 \iM, pepstatin at 1 \iM and 
phenylmethanesulfonylfluoride at 1 mM (Melford). 
The cell lysate supernatant was loaded onto a 5 mL 
HisTrap FF Crude (GE Healthcare) affinity column 
and washed with 100 ml of the wash buffer and 
eluted using the same buffer with 500 mM imidaz- 
ole. The concentration of fractions containing the 
fusion protein was determined using a NanoDrop 
1000 (Thermo Scientific) and the fractions were 
pooled and desalted into the wash buffer. One milli- 
gram of TEV protease (produced in-house*^) per 100 
mg of fusion protein was added for overnight cleav- 
age at 20° C. After TEV protease cleavage, the His- 
thioredoxin-tag and TEV protease were removed 
from the cleaved sample by passing it through a sec- 
ond 5 mL HisTrap FF Crude (GE Healthcare) and 
collecting the flow-through and wash fractions, 
which were pooled and concentrated before injection 
onto a S75 column (GE Healthcare) equilibrated 
with a crystallization buffer consisting of 20 mM 
Tris HCl, 150 mM NaCl, 10% glycerol, pH 8.0, 5 mM 
dithiothreitol (DTT). 

Crystallization of la NS5A D1 

Protein was concentrated to 8 mg/mL as determined 
with a NanoDrop 1000 (Thermo Scientific). To 
remove any precipitation, samples were subject to 
centrifugation at 16,060 g for 2 min. Initial hits 
were obtained from sparse matrix screens using sit- 
ting drop plates with 100 nL reservoir added to 100 
nL protein with a mosquito crystallization robot 
(TTP LabTech) before sealing and incubation of the 
plates at 4°C. The largest crystals were grown using 
microseeding with the crystals from the original con- 
dition containing 0.1 M MES pH 6.5, 1.6 M MgS04 
which were crushed and used as a seeding stock for 
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Table 11. Summary ofNSSA Domain 1 Crystal Structure Dimer Properties 



Distance between the centers Buried (solvent-accessible 

Dimer of mass of the monomers (A) surface) area ( A^) Rotation angle (° ) 

Tellinghuisen lb 25.0 1022.1 178.1 

Love lb 21.5 957.1 180.0 

laAB 23.1 435.1 179.8 

la CD 48.5 315.0 173.7 



an optimization plate consisting of sitting drops of 2 
|iL of the protein sample with 2 |.iL reservoir. After 
30 days incubation at 4°C, crystals were visible and 
grew to a full size of up to approximately 150 |.im 
after 60 days. 

X-ray data collection and structure 
determination 

Crystal hits were fished out of the solution in 0.1 to 
0.2 mm cryoloops (Hampton), before being immersed 
briefly in a buffer consisting of O.IM MES pH 6.5, 
1.6 M MgS04 and 30% glycerol, and then flash fro- 
zen in liquid nitrogen. Native data sets were col- 
lected using a PILATUS 6M detector at the micro 
focus beamline 124 at Diamond Light Source. Diffrac- 
tion data was processed with xia2 (Winter, 2009) and 
phases were calculated with Phaser** using the 
genotype lb NS5A domain 1 structure (PDB code: 
IZHl) as a search model. The process of refinement 
using Phenix.refine with NCS restraints*^ and model 
building was iterated. The model was validated using 
Molprobity and deposited within the PDB as 4CL1.*'' 
Data collection and final refinement statistics are 
given in Table I. Buried solvent accessible surface 
area, rotation angles and distances between centers 
of masses of the dimers were calculated with Chi- 
mera and are summarized in Table 11.*^ Amino acid 
conservation at dimer contact sites was determined 
by 95% conservation from 25 sequences from geno- 
types 1 to 6 from the Los Alamos database.** Figures 
were made using PyMOL (The PyMOL Molecular 
Graphics System, Version 1.5.0.4 Schrodinger, LLC). 

DCV docliing and molecular dynamics (MD) 

The head-to-head CD-dimer from our structure was 
prepared for docking and molecular dynamics (MD) 
studies using the Schrodinger Protein Prep Wizard*^ 
script. This script adds missing amino acid side 
chains and optimally adds hydrogen atoms into the 
structure. The cleft between the CD-dimer was iden- 
tified as a potential binding site and Glide*^ from 
the Schrodinger suite of programs was used to ini- 
tially dock DCV into this pocket. The resulting 
docked structure of DCV was hand-adjusted to 
enhance the symmetry of the CD-dimer/DCV com- 
plex. The NMR structure of the N-terminal alpha 
helices (residues 1-31,^^ IRTC.pdb) was grafted onto 
each monomer of the CD-dimer/DCV complex. The 



missing residues (32-35) were modeled into place 
and restraints added to enhance symmetry of the 
final modeled structure. 

Molecular simulation details 

The CD-dimer/DCV complex was placed in a 57.13 
X 129.52 X 50.40 A^ box with 9917 TIP3P^° water 
molecules and eight chloride ions. The MD simula- 
tion was carried out with NAMD.^^ The 
Amber99SB^^ force field was used to model the pro- 
tein, solvent, and ions, while the General Amber^^ 
force field was used for DCV. The Verlet integration 
scheme was used with a multi-time step^* of 2/2/6 
and Shake^^ was applied to all hydrogen bonds. The 
non-bonded interactions were tapered between 8 and 
10 A and particle-mesh-Ewald (PME)^® was used for 
long range electrostatics. The simulation was run 
with periodic boundary conditions in a NPT ensem- 
ble at 300 K and 1 atm. The solvated complex was 
first subjected to 250 steps of protein backbone- 
constrained conjugate gradient minimized followed 
by an additional 250 steps of unconstrained conju- 
gate gradient minimization. This was followed by 
MD heating (10 K increments every 100 steps up to 
300 K, NVT ensemble,) and equilibration (250 ps, 
NPT ensemble). The MD equilibrated system was 
subject to 40 nanoseconds (nsec) of MD and 40,000 
frames, one frame per picoseconds (psec), were col- 
lected for data analysis. 
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